Textual Stylistic Variation: Choices, Genres and Individuals
نویسنده
چکیده
T his chapter argues for more informed target metrics for the statistical processing of stylistic variation in text collections. Much as operationalized relevance proved a useful goal to strive for in information retrieval, research in textual stylistics, whether application oriented or philologically inclined, needs goals formulated in terms of pertinence, relevance, and utility—notions that agree with reader experience of text. Differences readers are aware of are mostly based on utility—not on textual characteristics per se. Mostly, readers report stylistic differences in terms of genres. Genres, while vague and undefined, are well-established and talked about: very early on, readers learn to distinguish genres. This chapter discusses variation given by genre, and contrasts it to variation occasioned by individual choice. 1.1 Stylistic variation in text Texts are much more than what they are about. Authors make choices when they write a : they decide how to organize the material they have planned to introduce; they select amongst available synonyms and syntactic constructions; they target an intended audience for the text. Authors will make their choices in various ways and for various reasons: based on personal preferences, on their view of the reader, and on what they know and like about other similar texts. These choices are observable to the reader in the form of stylistic variation, as the difference between two ways of saying the same thing. On a surface level this variation is quite obvious, as the choice between items in a vocabulary, between types of syntactical constructions, between the various ways a Swedish Institute of Computer Science Box 1263, S–164 29 Kista, Sweden [email protected]
منابع مشابه
Conventions and Mutual Expectations — understanding sources for web genres
Genres can be understood in many different ways. They are often perceived as a primarily sociological construction, or, alternatively, as a stylostatistically observable objective characteristic of texts. The latter view is more common in the research field of information and language technology. These two views can be quite compatible and can inform each other; this present investigation discu...
متن کاملËû×× Áò×øøøùøø Óó Óñôùøøö Ëëëëòòò Ëøýðð×øøø Üôööññòø× Óö Áòòóöññøøóò Êêøöööúð Âù××× Ããöððööò
Information retrieval systems are built to handle texts as topical items: texts are tabulated by occurrence frequencies of content words in them, under the assumption that text topic is reasonably well modeled by content word occurrence. But texts have several interesting characteristics beyond topic. The experiments described in this text investigate stylistic variation. Roughly put, style is ...
متن کاملSemantic Web Tools for Categorization Greek Texts on the Internet: the MeDa13 standard and TeGO ontology
The wider question of this study is the suitability of existing Web search engines for the needs of school education. It examines the relevance to the teaching objectives of the results returned by the search process given a query and its (stated or unstated) purpose in the context of an educational activity. The particular field of teaching and research interest is Modern Greek in Cypriot seco...
متن کاملModeling intra-textual variation with entropy and surprisal: topical vs. stylistic patterns
We present a data-driven approach to investigate intra-textual variation by combining entropy and surprisal. With this approach we detect linguistic variation based on phrasal lexico-grammatical patterns across sections of research articles. Entropy is used to detect patterns typical of specific sections. Surprisal is used to differentiate between more and less informationally-loaded patterns a...
متن کاملVisualizing Stylistic Variation
Texts vary not only by topic, but by style; indeed, often the variation between texts ‘about the same thing’ can be just as noticeable as the variation between texts ‘about different things’. Some facets of this variation are quite easy to detect, and quite predictable when applied to categorization of texts by genre, functional style, or tentatively quality. Making use of such variation in an ...
متن کامل